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Abstract: Fluorescent 2'-deoxynucleoti- 
des containing a protecting group at 
the 3'-0-position are reversible termi- 
nators that enable array-based DNA 
sequencing-by-synthesis (SBS) ap- 
proaches. Herein, we describe the syn- 
thesis and full characterisation of four 
reversible terminators bearing a 3'- 
blocking moiety and a linker-dye 
system that is removable under the 
same fluoride-based treatment. Each 



nucleotide analogue has a different flu- 
orophore attached to the base through 
a fluoride-cleavable linker and a 2-cya- 
noethyl moiety as the 3'-blocking 
group, which can be removed by using 
a fluoride treatment as well. Further- 
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more, we identified a DNA poly- 
merase, namely, RevertAid M-MuLV 
reverse transcriptase, which can incor- 
porate the four modified reversible ter- 
minators. The synthesised nucleotides 
and the optimised DNA polymerase 
were used on CodeLink shdes spotted 
with hairpin oligonucleotides to dem- 
onstrate their potential in a cyclic re- 
versible terminating approach. 



Introduction 

The completion of the two Human Genome Projects in 2001 
implemented the ultimate goal of sequencing the whole 
human genome. ^^'^^ This milestone not only provided a refer- 
ence genome, but also unexpected opportunities, such as 
new questions, goals, and hopes, increased demands for im- 
provements in the cost efficiency and throughput of DNA 
sequencing to an astonishing extent. These requirements not 
only address sequencing of whole novel genomes, but also 
the search for individual variation within the human 
genome. In fact, the latter is a very important issue of the 
postgenome era, as it promises to elucidate how genetic var- 
iation interacts with the environment to confer individual 
resistance or susceptibility to disease, success of medical in- 
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terventions, and drug response. For this reason, there has 
been a rapid development in genotyping technology. For ex- 
ample, many systems for detecting mutations or single nu- 
cleotide polymorphisms (SNPs) on a large-scale are current- 
ly commercially available. 

One genotyping technology, arrayed primer extension 
(APEX), is a minisequencing microarray assay capable of 
detecting different types of genetic variations, while combin- 
ing the efficiency of microarrays (an alternative to gel-based 
methods) and Sanger sequencing technology. ^"^'^^ In general, 
this method can be viewed as DNA sequencing by termina- 
tion with the use of labelled dideoxynucleotides in a DNA 
polymerase reaction. However, there is one important dif- 
ference between APEX and conventional Sanger sequenc- 
ing. Instead of using one primer and analysing hundreds of 
extension products with polyacrylamide gel electrophoresis 
(PAGE), hundreds to thousands of primers are spatially sep- 
arated beforehand as a two-dimensional array of oligonucle- 
otides. These primers are immobilised by attaching the 5'- 
end to a glass surface, and each oligonucleotide is extended 
at the 3'-end by only one dye-labelled dideoxynucleotide 
complementary to the nucleotide at the variable site. As a 
result, each primer identifies one base in the target se- 
quence. The advantages of APEX include parallel analysis 
of hundreds to thousands of genetic variations in a single re- 
action, high allelic discrimination by the use of a DNA poly- 
merase and four labelled terminators, and the possibility of 
locus- or disease-specific array design. Therefore, all four 
possible sequence variants can be detected simultaneously 
in one reaction. One disadvantage is that one primer is nec- 
essary for each position to be identified. This approach can 
lead to a very high number of ohgonucleotides on the array, 
which can cause problems in fluorescence detection. De- 
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pending on the region of DNA that has to be sequenced, it 
could even exceed the scope of the array. 

We have designed and synthesised a new generation of 
fluorescently labelled, reversibly terminating nucleotides, 
identified a DNA polymerase that accepts these nucleotides, 
and optimised the reaction conditions under which they are 
incorporated into the DNA fragments. The use of these re- 
versible terminators has the potential to fulfil the needs of 
repeated primer extension reactions on APEX DNA arrays. 
In this approach, one cycle consists of three steps: 1) DNA 
polymerase-mediated incorporation of the complementary 
reversible terminator onto the immobilised oligonucleotide 
primer sequence, 2) detection of the fluorescence signals 
specific for each of the four bases, and 3) cleavage of the ter- 
minating moiety and the reporter molecule to restore the 
free 3'-OH group and remove fluorescence signals of al- 
ready incorporated nucleotides. Repetition of this cycle 
leads to the template sequence. This approach, which uses 
our reversible terminators together with the DNA poly- 
merase, provides an opportunity to revolutionise the future 
of APEX technology because the number of oligonucleotide 
features on the array could be decreased according to the 
number of consecutive primer extension cycles on the chip 
surface. 

This idea of using 3'-reversibly blocked nucleotides for se- 
quencing was proposed in the beginning of the 1990s. The 
first examples of potential reversible terminators were re- 
ported in 1994 by Metzker and co-workers^^^ and Canard 
and Sarfati.^^^ However, the demands these molecules have 
to meet are challenging, and therefore implementation is 
difficult. The general structure of a reversible terminator 1 
is shown in Scheme 1. 



^nveT,Dy«A,DyeC,DyeG 

cleavable 
linker 



reversibly terminating moiety 
1 

Scheme 1. General structure 1 of a fluorescently labelled reversible ter- 
minator. 



The structural requirements for these nucleotides include 
a reversibly terminating moiety at the 3'-position and a re- 
porter molecule, such as a dye, attached to the base by a 
cleavable linker. In the design of a suitable reversible termi- 
nator, several important issues have to be considered: First- 
ly, the 3'-blocking group has to be stable during the poly- 
merase-mediated extension step to ensure effective abortion 
of elongation after incorporating a single nucleotide. Sec- 
ondly, a cleavable linker has to be designed to attach the re- 
porter moiety to the base. It is disadvantageous to combine 
the reporter and blocking group at the 3'-position. Welch 
and Burgess reported the lack of acceptance of bulky 3'- 
modifications by DNA polymerases, which was also con- 



firmed by a crystal-structure study of a rat DNA/primer/nu- 
cleotide complex. Thus, the linker has to be cleavable 
under conditions that match the cleavage of the 3'-blocking 
group to allow both the regeneration of the 3'-OH group 
and the removal of the linker-dye system in a single depro- 
tection step. Thirdly, the cleavage of the reversibly terminat- 
ing group and the linker should be quantitative without af- 
fecting the DNA-template stability. Fourthly, a polymerase 
is needed that accepts the 3'-modification and nucleotide 
modifications and still discriminates strictly between the 
four bases during the incorporation reaction. Whereas 
Sanger sequencing has proven that modifications at the 7- 
position of 7-deazapurines and the 5-position of pyrimidines 
are well tolerated, the choice of a suitable 3'-modification 
seems more difficult because the editing properties of poly- 
merases must also be considered. During the last several 
years, academic and industrial research groups have focused 
on the design of such reversible terminators. Within the 
scope of this study, several 3'-blocking groups were investi- 
gated, including bulky esters^^^ and ethers^^^ with the label at- 
tached to the blocking group and small groups. Some exam- 
ples are the 3'-0-(2-nitrobenzyl) group investigated by 
Metzker and co-workers^^^ and Welch and Burgess, ^^'^^^ the 
3'-(9-allyl group reported by Metzker, Ju,^^^^ and Kim,^^^^ or 
the 3'-(9-azidomethyl group, which was used by Ju and co- 
workers^^"^'^^^ and was also realised in a commercially avail- 
able device, the Genome Analyzer developed by Illumina/ 
Solexa.f^^'^^^ Other interesting groups are the 3'-0-NH2 
group from Benner and co-workers,^^^^ the 3'-0-(2-cyanoe- 
thoxy)methyl group reported by us,^^^^ or some 3'-blocking 
groups removable under mild reducing or mild acidic condi- 
tions reported by Kwiatkowski.^^^^ The terminators with 
bulky 3'-modifications exhibited problems with polymerase 
acceptance. Ester and carbonate linkages are easily cleaved 
by polymerases, thus leading to multiple incorporation 
events. Some cases in which the reporter is attached to the 
base by a cleavable linker, the cleavage conditions of the 
linker differ from those of the 3'-blocking group. Two differ- 
ent kinds of chemical treatments make these strategies more 
time consuming, thus showing that this research is very chal- 
lenging and encourages further investigation of new candi- 
dates for reversibly terminating groups. 

Herein, we present the synthesis of a complete set of four 
reversible terminators that bear the fluoride-cleavable 3'-(9- 
(2-cyanoethyl) group as a 3'-OH blocking moiety^^^^ and a 
suitable fluoride-cleavable linker^^^^ to connect the nucleo- 
side to a fluorescent dye. The polymerase incorporation ex- 
periments and first applications of the reversible terminators 
in an APEX system are also presented. 



Results and Discussion 

Synthesis of the four dye-labelled reversible terminators: 

Our target molecules are modified at the 3'-position with 
the fluoride-cleavable 2-cyanoethyl group^^^^ and at the base 
with our recently reported fluoride-cleavable linker 
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Scheme 2. Structures of the fluorescently labelled reversible terminators 2-5. 



(Scheme 2)P^^ The four fluorescent dyes chosen for labelhng 
the four reversible terminators are 5- and 6-carboxyfluores- 
cein for thymidine, cyanine 3.0 (Cy 3.0) for 2'-deoxycytidine, 
5- and 6-carboxy-X-rhodamine for 2'-deoxyadenosine, and 
cyanine 5.0 (Cy 5.0) for 2'-deoxyguanosine. These dyes were 
chosen as they are spectrally well separated from each 
other, well known, and commercially available. In addition, 
the same fluorescent dyes have been already used as di- 
deoxy terminator conjugates for a couple of years in classi- 
cal APEX reactions on a Genorama platform for SNP geno- 
typing,^^^^ mutation detection,^^'^^ and APEX-based rese- 
quencing.f^^^ 

To synthesise the four reversible terminators, there are 
two crucial steps: 1) introduction of the 3'-OH modification 
and 2) introduction of the linker-dye system. The linker was 
incorporated by the well-known Sonogashira cross-coupling 
reaction.f^^'^^^ As a prerequisite for this method of attach- 
ment, the four iodonucleosides (i.e., 5-iodo-2'-deoxyuridine 
(6), 5-iodo-2'-deoxycytidine (7), 7-deaza-7-iodo-2'-deoxyade- 
nosine (8), and 7-daza-7-iodo-2'-deoxyguanosine (9; shown 
in Scheme 3) are required starting compounds. Compounds 
6 and 9 were purchased from commercial sources, whereas 7 
and 8 were synthesised in our laboratory by combining steps 
from reported procedures that are not described in detail 
herein.P^-^2] 

For selective introduction of the 2-cyanoethyl group at 
the 3'-position, it is necessary to protect all the other func- 
tional groups. Saneyoshi et al. described the protection of 
the 2'-OH group with the 2-cyanoethyl group of all four 
RNA nucleosides with acrylonitrile in a Michael addition re- 
action with CS2CO3 as a heterogeneous base in ^BuOH.^^^^ 
For DNA, the modification of the 3'-OH group with the 2- 
cyanoethyl group has been reported by us for thymidine and 
5 [21,22] Pqj. ^j^^ other three nucleosides 7-9, we developed 
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Scheme 3. The four starting compounds 6-9. 



protecting-group strategies to enable the selective introduc- 
tion of the 3'-(2-cyanoethyl) group (Schemes 4 and 5). 

We used base-labile protecting groups for the amino func- 
tions and 5'-OH group. The synthesis of the 3'-(9-(2-cya- 
noethyl)-modified 2'-deoxyuridine analogue 10 was accom- 
phshed, as recently reported by us.^^^^ The same strategy was 
used for the protection of the functional groups of 7 and 8. 
The exocyclic amino groups of both nucleosides were pro- 
tected with formamidine. No acidic proton should be left to 
prevent the reaction of its conjugated base in the Michael 
addition. Both reactions were carried out following a proce- 
dure described for 5-iodo-2'-deoxycytidine.^^'^^ The formami- 
dino-protected 2'-deoxycytidine analogue 11 was obtained in 
a moderate yield of 62%. Whereas the previously reported 
yield^^^^ for the 2'-deoxyadenosine analogue 14 could be en- 
hanced from 85 to 95%. By carrying out the 5'-selective 
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Scheme 4. Protecting-group strategy and introduction of the 3'-modifica- 
tion into 6p^^ 7, and 8. Reagents and conditions : a) A^,A^-dimethylforma- 
mide dimethylacetal, dry DMF, 55 °C, 2.5 h; b) BzCl, dry pyridine/dry 
DMF = 4:1, 0°C^rt, 2h; c) acrylonitrile, CS2CO3, ?BuOH/dry DMF 2:1, 
rt, 3h; d) saturated methanolic ammonia, rt, 2.5 h; e) A^,A^-dimethylfor- 
mamide dimethyl acetal, dry DMF, 50 °C, 2h; f) BzCl, dry CH2CI2, dry 
pyridine, -15 °C, 1 h; g) acrylonitrile, CS2CO3, fBuOH, rt, 2h; h) satu- 
rated methanolic ammonia, 50 °C, 2 h. Bz = benzoyl, BzCl = benzoyl chlo- 
ride. 



benzoylation of 11 at 0°C at room temperature, 12 was pro- 
duced in a moderate yield of 60%. The 2'-deoxyadenosine 
analogue 15 could be obtained in excellent yield by carrying 
out the reaction at a low temperature (— 15°C). The Michael 
addition was accomphshed by following procedures reported 
by Saneyoshi et alP^^ and by our group J^^'^^^ Freshly distilled 
acrylonitrile was used in ^BuOH as the solvent and CS2CO3 
as the base. One modification of the procedure was necessa- 
ry for 12; in this case, DMF was used as a cosolvent to en- 
hance the solubility of the starting material. After purifica- 
tion, the fully protected intermediate was isolated in 88% 
yield. The fully protected 3'-modified 7-deaza-7-iodo-2'-de- 
oxyadenosine analogue was obtained in a quantitative yield. 
The deprotection reactions that lead to compounds 13 and 
16 were carried out in saturated methanolic ammonia to 
result in yields of 86 and 96% of 13 and 16, respectively. 
The first three 3'-modified key compounds, namely, 10, 13 



and 16, could be obtained by using very similar protecting- 
group strategies, which effectively enabled the selective 
modification of the 3'-OH group. Another protecting group 
strategy had to be applied for the fourth nucleoside 9 be- 
cause guanosine analogues have an additional functional 
group that has to be protected. The developed synthetic pro- 
cedure was optimised first by using natural 2'-deoxyguano- 
sine (not shown) and then transferred to the 7-deaza-7-iodo 
derivative 9. The synthesis of the 3'-0-(2-cyanoethyl) modi- 
fied derivative 21 is shown in Scheme 5. 



o 




Scheme 5. Protecting-group strategy and introduction of the 3'-modifica- 
tion into 9. Reagents and conditions: a) 1,1,3,3-tetraisopropyldichlorodisi- 
loxane, dry pyridine, 0°C^rt, 1 h; b) N,A^-dimethylformamide dimethyl- 
acetal, dry DMF, rt, 24 h; c) BzCl, dry pyridine, dry CH2CI2, 0°C^rt, 
2h; d)Et3N-3HF, THF, rt, 1 h; e) MMTrCl, DMAP, dry pyridine, rt, 
18 h; f) acrylonitrile, CS2CO3, ?BuOH, rt, 2h; g) PTSA, CH2Cl2/EtOH 
1:1, rt, 1 h; h) 32% aqueous NH3, MeOH, rt, 18 h. DMAP = A^,A^-dime- 
thylaminopyridine, MMTr = monomethoxytrityl, PTSA = para-toluenesul- 
fonic acid. 



For the protection of the atom, we chose benzoyl as a 
blocking group during the synthesis of the 2'-deoxyguano- 
sine analogue 21. For the intermediate protection of the 3'- 
and 5'-OH groups, we used the Markiewicz procedure. ^^^^ 
For the protection of the exocyclic amino group, the forma- 
midino group was used. These two steps were carried out 
without an intermediate purification step, thus furnishing 17 
in 95%. The subsequent benzoylation of the atom was 
carried out between 0°C and room temperature and pro- 
ceeded in 69 % yield. When followed by deprotection of the 
5'- and 3'-OH groups with Et3N-3HF in THF, the amino- 
protected compound 18 could be isolated in 97% yield. 
During the synthesis of the 2'-deoxyguanosine analogue 21, 
we deviated from the exclusively base-labile protecting- 
group strategy. To improve the solubility properties for the 
Michael addition reaction, we used the MMTr group to pro- 
tect the 5'-OH group of 18 to obtain fully-protected 19 in 
71 % yield. The 3'-modification was accomplished by follow- 
ing the same procedure employed for the 2'-deoxyuridine 
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analogue and the 2'-deoxyadenosine analogue 15, thus pro- 
viding fully protected 20 in an excellent yield of 88 % . After 
deprotection, first with PTSA for removal of the MMTr 
group followed by aqueous ammonia for cleavage of the 
amino protecting groups, the fourth key compound 21 could 
be obtained in 62 % yield over the last two steps. 

Recently, we described the synthesis of the 5- and 3'- 
modified 2'-deoxyuridine analogue 24S^^^ We coupled 5-io- 
donucleoside 10 to 1,1,1-trifluoroacetyl-protected propargyl- 
amine and, after subsequent deprotection, attached the 
linker to the free amino group through the A^-hydroxysucci- 
nimidyl carbonate group. Therein, we describe an improve- 
ment of this procedure by synthesising the linker derivative 
23 in a one -pot procedure (Scheme 6). 
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Scheme 6. Synthesis of the linker derivative 23. Reagents and conditions : 
a) l.DSC, anhydrous K2CO3, dry CH3CN, 0°C, 20 h; 2. propargylamine, 
KHCO3, 0°C ^rt, 5h. Boc = rerr-butoxycarbonyl, DSC = A^,A^'-hydroxy- 
succinimidyl carbonate. 



Compound 23 enables a direct Sonogashira coupling to 
the 3'-modified iodonucleosides 10, 13, 16 and 21 
(Scheme 7). This route saves two synthetic steps, and the 
yield of 24 could be improved from 49 % over three steps to 
80 % in one step. 
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Scheme 7. Linker attachment and triphosphate synthesis with all four 3'-0-(2-cyanoethyl) modified nucleosides 
10, 13, 16, and 21. Reagents and conditions: a) dry DMF, triethylamine, Cul (0.2 equiv), [Pd(PPh3)4] 
(0.1 equiv), alkyne 23 (1.2-2.0 equiv), rt, 3.5-5 h; b) 1. dry pyridine/dry dioxane (1:30, 2-chloro-4//-l,2,3-benzo- 
dioxaphosphorin-4-one, rt, 15min; 2. dry DMF, bis(tributylammonium)pyrophosphate, tributylamine, rt, 
20 min; 2. iodine in pyridine/water (2%, 98:2), rt, 20 min; 4. 5% NazSOg solution; c) TFA (40 equiv) in water, 
rt, 5-6 h. TFA = trifluoroacetic acid. 



The Sonogashira coupling was applied to all four nucleo- 
sides 10, 13, 16 and 21 under the same conditions (i.e., a 
Pd°/Cu^ catalyst system, DMF, EtgN, rt). The desired deriva- 
tives were obtained in good-to-very-good yields and with 
very high purities. The triphosphate synthesis was accom- 
plished by following the procedure of Ludwig and Eckstein 
reported in 1989.^^^^ In this step, 2-chloro-4//-l,2,3-benzo- 
dioxaphosphorin-4-one is used as the phosphorylating re- 
agent that is subsequently opened with pyrophosphate and 
finally oxidised with a 2 % iodine solution in pyridine/water. 
In general, the reaction proceeds well, as demonstrated by 
analysing the crude reaction product by ^^P NMR spectro- 
scopic analysis (not shown). These crude products were 
treated directly with TFA/water to remove the Boc protect- 
ing group from the amino function of the linker moiety. The 
compounds were obtained in yields of around 20% for 28, 
30 and 31 and only 8 % for 29 due to the multistep purifica- 
tion of these compounds, which is crucial after this step. 
Thus, two purification steps had to be performed to achieve 
the necessary purity. Anion-exchange chromatography at 
4°C on a fast-protein liquid chromatography (FPLC) system 
was the first step followed by reverse phase (RP)-HPLC to 
yield 28-31 with a high degree of purity in quantities be- 
tween 22 and 120 mg (31 and 28, respectively). The obtained 
triphosphate derivatives produced positive results in the fol- 
lowing labelling reactions with the selected fluorescent dyes. 
All the dyes (i.e., 5- and 6-carboxyfluorescein, 5- and 6-car- 
boxy-X-rhodamine, Cy 3.0, and Cy 5.0) were used in the ac- 
tivated A^-hydroxysuccinimidyl ester form. The coupling re- 
actions were carried out in dry DMF with KHCO3 as a het- 
erogeneous base in the case of 28 and A/,A^-diisopropylethyl- 

amine (DIPEA) as a base for 
the other nucleotides 29-31. 
The purification of the final 
compound was also very labori- 
ous; prepurification on the RP- 
FPLC system with self-packed 
columns was necessary to 
remove the starting-material 
dye before final purification 
with RP-HPLC was possible. 
After this procedure, the four 
reversible terminators 2-5 
(Scheme 2) were obtained with 
a high degree of purity in quan- 
tities between 1.5 and 6 mg (for 
5 and 2, respectively) and con- 
firmed by and ^^PNMR 
spectroscopic and mass-spectro- 
metric analysis. The ^^P NMR 
spectra of all four compounds 
prove the structure of the ob- 
tained compounds (Figure 1). 

The four labelled reversible 
terminators 2-5 were used in 
polymerase incorporation 
assays and in the demonstration 
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of a proof-of-principle for reversible primer extension on 
immobilised template oligonucleotides. 

Polymerase identification for primer extension: Several as- 
pects of the system appear to be critical to incorporate re- 
versible terminators enzymatically into the nucleic acid sub- 
strate. The polymerase must have the proper affinity and se- 
lectivity towards modified nucleotides along with adequate 
turnover rate and propensity to form terminated DNA mol- 
ecules that remain stable until the 3'-modifications are re- 
moved by chemical means. In turn, the reversible terminator 
itself must display an appropriate balance between chemical 
stability during enzymatic incorporation and the ability to 
be unblocked under conditions mild enough for the DNA 
structure to be retained for further enzymatic treatment. A 
solution-based DNA primer extension system that is suita- 
ble for simultaneous evaluation of substrate extension by 
either single or multiple T residue(s) was used for initial 
screening of a wide range of polymerases that represent 
major famihes of DNA polymerases (A, B, X, Y, and re- 
verse transcriptases) and some of their mutants. This system 
enables the gel-based detection of primer extension prod- 
ucts with single nucleotide resolution. By following the 
primer-extension step, natural thymine triphosphate (dTTP) 
was added to check the stability of the termination of the re- 
sulting -hi product. Polymerases that exhibited even mini- 
mal proofreading activity were found to be incompatible 
with the assay due to primer truncation (data not shown). 
We found 3'-0-(2-cyanoethyl)-dTTPf^^^ to be incorporated 
into DNA (Figure 2 A) by several reverse transcriptases 
only (Figure 2B). All enzymes possess the ability to incorpo- 
rate the compound (see even-numbered lanes for bands that 
appear just above the primer), although the efficiency of in- 
corporation varies. RevertAid M-MuLV reverse transcrip- 
tase yielded a single primer-extension product (Figure 2 B, 
lane 4) that was resistant towards extension by supplement- 
ed dTTP (Figure 2B, lane 5) due to its 3'-terminal modifica- 
tion. Optimisation of the reaction conditions was necessary 
due to the low efficiency of the -h 1 extension (reaction time 
was 60 min) and was accomplished by including 1 mM MnCl2 
into the reaction mixture. The reaction rate increased signif- 
icantly, thus driving the primer-extension reaction to com- 
pletion in just 15 min (Figure 2C). Incorporation of the dye- 
conjugated 3'-modified compounds 2-5 under identical con- 
ditions is of comparable speed. Figure 2D presents the pro- 
files of the primer extension reactions that involve com- 
pounds 2-5 after 5 min. 

The specificity of primer extension was addressed by per- 
forming the reaction under multiplex conditions, in which 
four DNA substrates of different length, each competent for 
extension by a particular nucleotide only (Figure 3 A), were 
simultaneously present in the reaction mixture. Incorpora- 
tion of the individual 3'-modified nucleotides (Figure 3 B, 



Figure 1. ^^P NMR spectra of the four dye -labelled reversible terminators 
2-5. 
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Figure 2. The use of 3'-0-(2-cyanoethyl)-dNTP derivatives for primer ex- 
tension by reverse transcriptases. A) DNA substrate used (* indicates the 
radiolabelled 5' -terminus). The position in the template strand that di- 
rects the nucleotide to be incorporated is in bold type. B) Incorporation 
of 3'-0-(2-cyanoethyl)-dTTR Lanes 1: starting-material primer, poly- 
merases; 2 and 3: M-MuLV; 4 and 5: RevertAid M-MuLV; 6 and 7: Re- 
vertAid H minus M-MuLV; 8 and 9: AMV (cloned); 10 and 11: Thermo- 
Script; 12 and 13: Superscript III; 14 and 15: Transcriptor. dTTP was 
added into the reaction mix after the primer extension step, as indicated 
for lanes 3, 5, 7, 9, 11, 13, and 15. C) Time course of incorporation of 3'- 
0-(2-cyanoethyl)-dTTP by RevertAid M-MuLV reverse transctiptase in 
the presence of Mn^"*" ions. D) Extension of the corresponding substrates 
by the dye-labelled 3'-0-(2-cyanoethyl)-modified nucleotides 2-5 during 
a reaction time of 5 min. 



lanes 2, 4, 6 and 8) or a mixture of all four (Figure 3 B, lane 
10) demonstrated the performance of the system under con- 
ditions typical for sequencing reactions. DNA substrates are 
extended by relevant 3'-modified nucleotides only and 
remain unaffected in presence of natural 2'-deoxynucleotide 
triphosphates (Figure 3 B, lanes 3, 5, 7, 9 and 11). At this 
stage, a more detailed assessment of polymerase specificity 
was not performed due to differences between the condi- 
tions for the hquid-state reactions and the ones involving 
the microchip surface, typical for APEX. 

Extension of immobilised DNA templates: The complete 
system was tested in a reversibly terminating approach by 
using the set of our four fluorescently labelled reversible ter- 
minators 2-5, the RevertAid M-MuLV reverse transcriptase, 
and the reaction buffer optimised for primer extension. 

To test the labelled reversible terminators 2-5 for their in- 
corporation into immobilised DNA, a set of 16 oligonucleo- 
tides that differ at positions Q and Z was designed to pro- 
vide a readout of all possible variants of two consecutive nu- 
cleotides at X and Y positions (Figure 4 A). These primers 
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have an internal self-complementary region that allows 
folding into a hairpin structure, thus providing a double- 
stranded 3'-terminus for extension by the polymerase. All 
the oligonucleotides were functionalised with a 5'-C6 
amino modification to enable their immobilisation onto 
the amino-reactive array surface. We chose CodeLink-acti- 
vated slides because their glass surface is coated by a syn- 
thetic material that is compatible with our deprotection 
conditions (fluoride ions) in contrast to glass slides that 
would be harmed by a prolonged fluoride treatment (data 
not shown). The spotting was accomplished by using a 
10 |LiM solution of each oligonucleotide in 100 mM carbon- 
ate buffer (pH 9.0) with the Versarray Chip writer Pro Ar- 
rayer. The spotting layout of the CodeLink shdes is shown 
in Figure 4B and is arranged so that the first primer-exten- 
sion event is conveniently followed by signals that appear 
vertically (Figure 4 B, red letters) and the second event 
represented by signals that appear horizontally (blue let- 
ters). The primer layout shown in Figure 4 was spotted 
twice on each shde (array). The composition of the reac- 
tion mixture for the extension of immobilised primer was 



S'-'I^GCAGACAGCGATGGGTAGGGAK^^ 

3'- AOQTCTOTGGCTACCXSATCCGTAGGAACCGCATCCCGGG^ 

S'-'tGCAGACATGGQTAGGGATGGTTGGOGTA 
3'- AOQTCIOTAGCGATOCOTAGGMAC^^ 

S'-'tGCATQGCn'AGGCATCCTTGGCGTA 

3'- ACQTACCCATCXX3TAGGAACCGCATAA(X3GGCGGCG 

5'-'tGK3GrrAGGCAT€CTTGGC(3TA 

3'- ACCCATCCGTAGGAACCGCATGGCGGGCGGCG 
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Figure 3. Specific primer extension with 3'-0-(2-cyanoethyl)-dNTP deriv- 
atives by RevertAid M-MuLV reverse transctiptase. A) Set of DNA sub- 
strates used for specificity studies (* indicates radiolabelled 5'-termini). 
The position in the template strand that directs the nucleotide to be in- 
corporated is in bold type. B) The initial primer-extension step (lanes 2, 
4, 6, 8, and 10) was followed by the addition of a dNTP mixture to the so- 
lution containing all four natural dNTPs dissolved in water and incuba- 
tion for 5 min (lanes 3, 5, 7, 9, and 11). Lanes 1 and 12: the starting-mate- 
rial primer set; 2 and 3: 3'-0-(2-cyanoethyl)-dGTP; 4 and 5: 3'-0-(2-cya- 
noethyl)-dATP; 6 and 7: 3'-0-(2-cyanoethyl)-dTTP; 8 and 9: 3'-0-(2-cya- 
noethyl)-dCTP; 10 and 11: full set of 3'-0-(2-cyanoethyl)-dNTP nucleo- 
tides. 
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Figure 4. Schematic illustration of the hairpin primers and the spotting 
layout of the slides. A) The variable positions in the template part are 
marked as Z for the first template nucleotide and as Q for the second. 
The variable positions immediately after the 3'-terminus are marked as X 
and Y for the first and second nucleotide to be incorporated, respectively. 
B) Each of the 16 oligonucleotides is applied on the slide in four spots ar- 
ranged on the grid in squares. In a column (vertically), all the four 
squares encode the same base for the first incorporation event (X). Each 
square within a row (horizontally) encodes one of the four nucleotides 
for the second incorporation event (Y). 



arrays spotted on the slide. The specificity of a particular 
fluorescent-nucleotide incorporation into the first position X 
of the primers was calculated by combining the signals from 
four primers located within the same row and that bear the 
same nucleotide at the Q position (Figure 4 A) and by evalu- 
ating the percentile of detected fluorescence within each of 
the four primers. This approach enables the evaluation of 
the polymerase specificity within the same sequence context. 
The imaging results obtained after the first step showed that 
all four reversible terminators were incorporated with high 
selectivity (as summarised in Figure 5). We calculated that 
the specificity of the incorporation of 5 (channel read G) 
into primer GG reached 94.2% and the highest nonspecific 
incorporation of 5 is 5.1% of the total signal into primer 
AG. Incorporation of 2 (channel read T) was the most spe- 
cific and varied from 96.8 to 99.6%; lower specificity was 
observed for 3 and 4 (channel read A and C; 74.2-89.3 and 
80.5-89.5%, respectively). 

Preliminary experiments revealed that a fraction of the 
spotted oligonucleotides remains unextended by reverse 
transcriptase during the first elongation step (data not 
shown), thus resulting in the appearance of a high back- 
ground during the second elongation step. The terminal de- 
oxynucleotide transferase (TdT) catalyses a template-inde- 
pendent addition of 2'-deoxyribonucleotides to the 3'-OH 



further improved by the addi- 
tion of bovine serum albumin 
(BSA), a reported compound 
to counteract protein adsorp- 
tion and inactivation on wafer 
surfaces. ^^^^ Furthermore, nu- 
merous osmolytes were 
screened and 2-hydroxyme- 
thyl[18]crown-6 was determined 
to be the most efficient enhanc- 
er of primer extension. Evalua- 
tion of spotted CodeLink slides 
for the acceptance of all four 
fluorescently labelled reversible 
terminators 2-5 confirmed the 
time-dependent character of 
the primer extension. The maxi- 
mum signal was obtained after 
a reaction time of 30 min (not 
shown). This time-point was 
used in the reversible primer- 
extension experiment 
(Figure 5). 

Figure 5 depicts the results of 
the fluorescence detection after 
the first and second elongation 
steps in which each picture rep- 
resents a scan of one channel. 
The tables in the middle of 
Figure 5 show the specificity 
values of the first elongation 
step estimated from the two 
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Figure 5. Imaging for illustration and analysis of the primer-extension events. Signals of the first elongation 
appear vertically (in column), signals of the second elongation appear horizontally (in rows). Evaluation of the 
specificity of the primer extension is given in the tables, and the target nucleotide is in bold type. Fluorescence 
readout channels are shaded and present the data in columns (vertically) for the first extension step. Signals in 
circles are artefacts. 
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group of various DNA and RNA substrates and efficiently 
incorporates the polymerisation terminator 2',3'-didesoxy- 
adenosine-5'-triphosphate (ddATP). On the other hand, we 
found that TdT does not extend DNA substrates that bear 
the 3'-0-(2-cyanoethyl) protecting group (results not 
shown). Therefore, the first elongation step was followed by 
incubation of the spotted slides with TdT in the presence of 
ddATP. During this treatment, it was expected that unla- 
belled (and uncleavable) 3'-dideoxy terminators would be 
incorporated into the structure of previously unextended 
primers and prevent them from being extended during the 
second elongation step. Afterwards, the slide was imaged 
and treated with a 1:1 solution of tetrabutylammonium fluo- 
ride (TBAF; 1m)/THF and DMF at 45 °C for 15 min to 
cleave the fluorescent dye and 3'-0-(2-cyanoethyl) group. 
To ensure that the linker-dye moiety was completely re- 
moved, this process was controlled by a separate imaging 
step in which only the background-level fluorescence signal 
could be detected (not shown). 

On these deprotected slides, the second round of primer 
extension was executed under conditions identical to the 
first round. The pictures of fluorescence detection after the 
second elongation are displayed in Figure 5 as well. During 
this step, the same nucleotide is expected to be incorporated 
into primers positioned in the same row (horizontally). 
These pictures demonstrate the feasibihty of the second 
primer extension because the spots are detected at expected 
positions. However, the efficiency of the incorporation of 
the individual fluorescently labelled reversible terminators 
2-5 was 15-30 times lower relative to that observed during 
the first elongation step. The selectivity suffered mainly 
from incorporation into those primers that were substrates 
during the first elongation step. The latter phenomenon is 
manifested by the appearance of additional signals in col- 
umns (images on the right part of Figure 5) the positions of 
which coincide with those labelled during the first elonga- 
tion step (images on the left part of Figure 5). Several fac- 
tors might account for this discrepancy during the primer ex- 
tension on immobilised templates. During the experiments 
on the shdes, the efficiency of the unblocking process was 
monitored by measuring the remnant fluorescence and ex- 
pecting the 3'-0-(2-cyanoethyl) group to be removed as effi- 
ciently as the linker-dye system. This outcome was true for 
chemical unblocking in solution or on a controUed-pore 
glass (CPG) -immobilised oligonucleotide,^^^^ but not necessa- 
rily for hairpin-forming oligonucleotides immobilised on a 
slide. Therefore, a probable reason for the decreased incor- 
poration efficiency during the second elongation step could 
be the nonquantitative chemical deprotection of the 3'-OH 
group. On the other hand, we observed that the enzyme we 
used extends a fraction rather than all spotted oligonucleo- 
tides, and this feature should also be responsible for the par- 
tial decrease in the signal intensity after the second elonga- 
tion step. The apparently less-selective incorporation of 
modified nucleotides 2-5 during the second elongation step 
appears to be related to a fraction of substrates that remain 
unblocked during the first elongation step and subsequent 



TdT treatment (Figure 5, compare the images on the left 
and right). However, this finding is true for the incorpora- 
tion of all the modified nucleotides, except for compound 3 
(Figure 5 ; see the channel read A) in which no increased 
fluorescence was observed after the second elongation step 
in column A. On the basis of this observation, it might be 
speculated that ddATP incorporation by TdT was the most 
complete when the complementary T nucleobase was pres- 
ent in the template strand (all four primers of such structure 
are located within the A column). On the other hand, unex- 
tended primers might become available not only if treat- 
ment by TdT is incomplete, but also as a result of pyrophos- 
phorolytic dismutation.^^^^ One also should consider that all 
four nucleotides employ a linker between the nucleobase 
and fluorescent dye, which results in a propargylamine resi- 
due after the cleavage of the dye. The modification of the 
nucleobase is an additional stimulus for triphosphate-main- 
tained phosphorolysis,^'^°^ thus resulting in truncated primers 
available for further extension. 

Altogether, we have demonstrated the feasibility of the 
template-directed incorporation of the 3'-terminated dye-la- 
belled nucleotides 2-5 that can be cleaved in one step to 
enable the next addition, thus providing the proof-of-princi- 
ple of a functional system also targeted to run in cyclic 
mode. 



Conclusion 

Herein, we have described the synthesis and full characteri- 
sation of four fluorescently labelled reversible terminators 
2-5. In these molecules, the 3'-blocking moiety and the 
linker-dye system are removable during the same fluoride- 
based deprotection treatment. The synthesised molecules 
are recognised and accepted by an identified DNA poly- 
merase. The reversible terminators and identified poly- 
merase, which was able to incorporate them, were also used 
in a cyclic reversibly terminating approach on CodeLink 
slides spotted with hairpin oligonucleotide probes. These re- 
sults demonstrate that the entire system is applicable in a 
cyclic reversibly terminating approach. Further studies that 
address the specificity and efficiency of the complete system 
are ongoing. 

Experimental Section 

Materials and methods: ^HNMR spectra were recorded on Bruker AM, 
DPX, and AV instruments at 250, 300, or 400 MHz and 300 K. ^^C spec- 
tra were recorded on Bruker AM, DPX, and AV instruments at 62.5, 75, 
100, or 150 MHz. The chemical shifts d in ^H and ^^^C NMR spectra are 
reported in ppm relative to the solvent signal. The fine structure of 
proton signals was specified by s (singlet), d (doublet), t (triplet), q (quar- 
tet), m (multiplet), dd (doublet of doublets), or br (broad) and in quota- 
tions for pseudofine structures. Assignments in the ^H and ^^CNMR 
spectra were made by DEFT, COSY, HSQC, and HMBC experiments. 
TLC analysis was carried out on polygram Sil G/UV254 by Macherey- 
Nagel & Co. (Dueren; thickness = 0.2 mm), 60 F254 (Merck KGaA, Darm- 
stadt; thickness = 0.2 mm), or RP-18W (Sigma-Aldrich Chemie GmbH, 
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Steinheim ; fluorescence indicator = 254 nm, thickness = 0.15 mm) . 
Column chromatography was carried out on silica gel 60 (Merck KGaA, 
Darmstadt; 40-63 jum) at normal pressure or on sihca gel 60 (Merck 
KGaA, Darmstadt; 15-40 jum) at a pressure of 2-3 bar (flash chromatog- 
raphy). FPLC was performed at 4°C on a Pharmacia FPLC system 
equipped with a single-path monitor UV-1 UV detector (1 = 254 nm) and 
self-packed columns of different sizes with diethylamnoethyl (DEAE) se- 
pharose material for ion-exchange FPLC ( Sigma- Aldrich; 0.05 m triethy- 
lammonium hydrogencarbonate (TEAB) buffer pH 8.0 (A)/0.8m TEAB 
buffer pH 8.0 (B) as the eluent) or octadecyl-functionalised silica gel 
(Sigma-Aldrich; water (A)/CH3CN (B) as the eluent) for RP-FPLC. RP- 
HPLC was performed on a Jasco LC-2000Plus HPLC system equipped 
with a Jasco UV 2075Plus detector (detection at 1 = 254 nm) and a Shi- 
madzu RF-353 fluorescence detector (excitation and emission at the spe- 
cific wavelength of the dye used). For the reversed-phase separation, 
Phenomenex Jupiter 4\i Proteo 90 A 4 jum columns (250 x 15 mm for 
preparative and 250 x 4.6 mm for analytical separations) were used with 
1 M triethylammonium acetate (TEAA) buffer pH 6.5 (A)/water (B)/ 
CH3CN (C) as the eluents. UV detection was accomplished at 1 = 
254 nm. Ion-exchange HPLC was performed on a Jasco LC-900 HPLC 
system equipped with a Jasco UV-970 detector (detection at 1 = 254 nm) 
and a Dionex BioLC DNAPac PA-100 column (250 x9 mm) with water/ 
0.25 M tris(hydroxymethyl)aminomethane hydrochloride (Tris-HCl) buffer 
(pH 8)/l M sodium chloride solution as the eluents. ESI mass-spectrome- 
try was performed on a Fisons instrument equipped with a VG platform 
II with quadrupol analyser. UV spectroscopy was performed on a Jasco 
V-650 spectrometer with 0.1 cm cuvettes. Fluorescence spectroscopy was 
performed on a Hitachi F4500 fluorescence spectrometer with 0.3 cm 
cuvettes. Elemental analyses were recorded on a Foss-Heraeus CHN-O 
Rapid instrument. The numbering of the atoms to assign the NMR signal 
are not related to the lUPAC numbering or the numbers used in the 
names of the compounds. Some signals in the ^H and ^^C NMR spectra of 
the triphosphate compounds are doubled, although this did not occur in 
the spectra of nucleoside derivatives 24-27. We assume that this outcome 
is due to the diastereoisomers that are obtained by introduction of the 
linker as a racemic mixture. These nucleoside derivatives are given in the 
i^C NMR data as X.XXA^.YY. 

General procedure for the Sonogashira coupling (GP2): In a heat-gun- 
dried Schlenk flask in an Ar atmosphere, the nucleoside was dissolved in 
dry DMF (0.1 mmolmL^) and triethylamine (5 equiv). The mixture was 
degassed by using the freeze/thaw technique for three times. After the re- 
action mixture had been warmed to room temperature, Cul (0.2 equiv) 
and [Pd(PPh3)4] (0.1 equiv) were added. In a separate heat-gun-dried 
Schlenk flask, alkyne 23 (1.2-2.0 equiv) was dissolved in dry DMF 
(250 juL) and added in two portions to the reaction mixture with a sy- 
ringe. One half portion was added immediately and the other half por- 
tion after 1 h. The reaction mixture was stirred for 3.5-5 h at room tem- 
perature in the dark. The reaction mixture was evaporated to dryness, 
the crude product redissolved in CH2CI2 (10 mL) and 5% ethylenediami- 
netetraacetate (EDTA; 5mL) solution, and stirred vigorously for lOmin. 
The layers were separated and the organic layer was washed with 5% 
EDTA solution (5mL), dried over sodium sulphate, and concentrated in 
vacuum. The residue was purified by flash-column chromatography. For 
example, the synthesis of 5-(prop-2-ynyl)carbamic acid-l-{2-[2-(2-ferr-bu- 
toxycarbonylaminoethoxy)ethoxy]ethoxymethyl)-2-cyanoethyl ester-3'-0- 
(2-cyanoethyl)-2'-deoxyuridine (24). 



23 




Following GP2, the reaction of 5-iodo-3'-0-(2-cyanoethyl)-2'-deoxyuri- 
dine (10; 250mg, 0.61 mmol, 1.0 equiv) with alkyne 23 (305 mg, 
0.74 mmol, 1.2 equiv) gave, after flash-column chromatography (CH2CI2/ 
MeOH 99:1^90:10), 24 as a slightly yellow oil (340mg, 80%). i?F=0.34 
(CH2Cl2/MeOH 90:10); ^HNMR (300 MHz, [D^DMSO, 300 K): (5 = 1.37 
(s, 23-H; 9H), 2.16-2.31 (m, 2'-H; 2H), 2.77 (t, 25-H; 2H), 2.88 (m, 12- 
H; 2H), 3.06 (q, 20-H; 2H), 3.37 (t, 19-H; 2H), 3.48-3.65 (m, 5'-H, 24-H, 
14-H, 15-H, 16-H, 17-H, 18-H; 7x2H), 3.93-3.97 (m, 4'-H; IH), 4.02 (d, 
9-H; 2H), 4.14 (quintet, 3'-H; IH), 4.95 (quintet, 11-H; IH), 5.13-5.19 
(m, 5'-OH; IH), 6.07 (t, I'-H; IH), 6.68-6.77 (m, 20-NH; IH), 7.93 (t, 9- 
NH; IH), 8.14 (s, 6-H; IH), 11.64 ppm (s, N^H, /9,9_nh = 5.7, /i9,2o = 
6.0Hz; IH); ^^CNMR (75 MHz, [DglDMSO, 300 K): (5 = 18.23 (25-C), 
19.72 (12-C), 28.21 (23-C), 30.68 (9-C), 36.90 (2'-C), 39.69 (20-C), 61.14 
(5'-C), 63.56 (24-C), 67.91 (11-C), 69.17 (19-C), 69.49, 69.60, 69.73, 70.21, 
70.30 (14-C, 15-C, 16-C, 17-C, 18-C), 74.49 (7-C), 77.58 (22-C), 79.08 (3'- 
C), 84.64 (r-C), 84.89 (4'-C), 89.57 (8-C), 98.23 (5-C), 117.64 (13-C), 
119.26 (26-C), 143.40 (2-C), 143.56 (6-C), 154.84 (10-C), 155.54 (21-C), 
161.49 ppm (4-C); ESI+-MS (m/z): calcd: 692.71 [C31H44N6O12] ; found: 
710.5 [M-\-¥L20]+, 715.4 [M-hNa]+; elemental analysis (%) calcd for 
C31H44N6O12: C 53.75, H 6.40, N 12.12; found: C 53.53, H 6.51, N 11.88. 
General procedure for the synthesis of A^-deprotected triphosphates 
(GP3): The A^-Boc protected nucleoside was vacuum dried in a Schlenk 
flask for 2 days, the flask was flushed with argon, and the nucleoside 
(0.1 mmol) dissolved in dry pyridine (100 |iL) and dry dioxane (300 |iL). 
A freshly prepared solution of 2-chloro-4//-l,2,3-benzodioxaphosphorin- 
4-one (1m, 1.15 equiv) in dry dioxane was added with a syringe and the 
reaction mixture stirred for 15 min at room temperature. Dry DMF 
(150 |llL), bis(tributylammonium)pyrophosphate (1m, 1.5 equiv), and trib- 
utylamine (1 m, 4.35 equiv) were added simultaneously with syringes. The 
reaction mixture was stirred for 20 min before a solution of iodine in pyr- 
idine/water (2%, 98:2) was added (1 mL). The reaction mixture was left 
for a further 20 min. Afterwards, the excess of iodine was quenched with 
5% Na2S03 solution, and the mixture was evaporated to dryness. The 
crude triphosphate derivatives (0.1 mmol) were dissolved in water (5 mL) 
containing TFA (40 equiv). The solution was stirred for 5-6 h at room 
temperature and subsequently concentrated in vacuum. For the ion-ex- 
change FPLC purification, the deprotected triphosphate derivatives were 
dissolved in Millipore water (8-12 mL) and filtered through a 0.45 |im sy- 
ringe filter. The following gradient was used for the separation at a flow 
rate of 4mLmin-i: 0% B (OmL)^50% B (500mL)^100% B 
(650 mL). The combined fractions were lyophilised, dissolved in 1-2 mL 
of Millipore water, and filtered through a 0.45 jum syringe filter before 
the second purification step by preparative RP-HPLC. The following gra- 
dient was used at a flow rate of 6mLmin~^: Omin: 10:90:0 A/B/C, 
15 min: 10:60:30 A/B/C; 16 min: 0:0:100 A/B/C; 22 min: 0:0:100 A/B/C; 
23 min: 10:90 A/B/C, 30 min: 10:90:0 A/B/C. The final yields were calcu- 
lated from the ^H NMR spectrum of the HPLC purified triphosphate de- 
rivatives because the products contained traces of triethylammonium ace- 
tate after RP-HPLC purification (for example, the synthesis of 28). 
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Following GP3, the reaction of the A/^-Boc-protected nucleoside 24 
(500 mg, 0.72 mmol, 1.0 equiv) yielded triphosphate 28 as a colourless oil 
(140 mg, 23%). Rp = 039 (/Pr/H20/EtMe2N 6:3.8:0.2); lE-FPLC: elution 
concentration = 0.29m TEAB/buffer (32% B); RP-HPLC: retention 
time = 14.92 min; ^HNMR (300 MHz, D2O, 300 K): ^ = 2.32 (ddd, 2'-H; 
IH), 2.55 (ddd, 2'-H'; IH), 2.81 (t, 22-H; 2H), 2.89-2.99 (m, 12-H; 2H), 
3.25 (m, 20-H; 2H), 3.67-3.79 (m, 14-H, 15-H, 16-H, 17-H, 18-H, 19-H; 
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6x2H), 3.84 (t, 21-H; 2H), 4.12-4.26 (m, 5'-H, 9-H; 2x2H), 4.37 (brs, 
4'-H; IH), 4.45-4.49 (m, 3'-H; IH), 5.12-5.23 (m, 11-H; IH), 6.26 (dd, 
I'-H; IH), 8.17 ppm (s, 6-H, /2;2" = 14.5, Jv,2' = ^A A',2" = 5.7, /2i,22 = 
6.0 Hz; IH); ^^CNMR (75 MHz, DA 300 K): (5 = 18.91 (22-C), 20.21 
(12-C), 31.49 (9-C), 37.28/37.30 (2'-C), 39.67 (21-C), 64.43 (21-C), 66.31 
(d, 5'-C), 66.95, 70.09, 70.12, 70.19, 70.84, 71.18 (14-C, 15-C, 16-C, 17-C, 
18-C, 19-C), 69.30 (11-C), 73.98/74.01 (7-C), 80.45/80.47 (3'-C), 84.31 (d, 
4'-C), 86.50 (I'-C), 90.93 (8-C), 99.77 (5-C), 118.78 (13-C), 120.45 (23-C), 
145.34/145.35 (6-C), 151.03 (2-C), 157.27/157.29 (10-C), 164.99 ppm (4-C, 
/4,^_p = 8.82, /5-„ p = 5.60Hz); ^^P NMR (121 MHz, D2O, 300 K): 6 = 
-10.97 (d, Y-P), -11-69 (d, a-P), -23.43 ppm (t, |3-P); ESI -MS (m/z): 
calcd for [C26H39N6O19P3]: 832.54; found: 831.4 [M-H]", 849.6 [M-H + 
H2O]-. 

Synthesis of 2: KHCO3 (3mg, 30 jumol, 2.0 equiv) was added to a solu- 
tion of triphosphate 28 (12 mg, 14 |Lunol, 1.0 equiv) in dry DMF (500 |llL). 
The reaction mixture was cooled to 0°C, and 5- and 6-carboxyfluores- 



cein-(A^-hydroxysuccinimidyl)ester (10 mg, 20 jumol, 1.5 equiv) was dis- 
solved in dry DMF (250 |llL) and added in two portions to the reaction 
mixture, one half immediately and the other after Ih at 0°C. The reac- 
tion mixture was stirred for 20 h at 0 °C. After evaporation of the solvent, 
the crude product was purified by RP-FPLC at 4 °C and preparative RP- 
HPLC (see GP4 in the Supporting Information). The triethyl ammonium 
salt of the labelled triphosphate 2 was obtained as a yellow sohd (6mg, 
34%) as a mixture of the two regioisomers, each consisting of two diaste- 
reomers. i?F = 0.65 (H2O/CH3CN 80:20); RP-FPLC: elution concentra- 
tion =2.5 ^6% B; RP-HPLC: retention time = 16.49 min isomer A, 
16.90 min isomer B (UV detection), 16.53 min isomer A, 16.94 min 
isomer B (fluorescence detection); UV absorption: 1 = 494, 277, 235 nm; 
fluorescence: X,, = 492, lem = 512nm; ^HNMR (250 MHz, D2O, 300 K): 
(5 = 2.15-2.37 (m, 2'-H; IH), 2.44-2.59 (m, 2'-H'; IH), 2.81-2.93 (m, 12-H, 
23-H; 2x2H), 3.57-3.91 (m, 14-H, 15-H, 16-H, 17-H, 18-H, 19-H, 20-H, 
22-H; 8x2H), 4.00 (brs, 9-H; 2H), 4.10-4.32 (m, 4'-H, 5'-H; 3H), 4.10- 
4.32 (m, 3'-H; IH), 4.99-5.14 (m, 11-H; IH), 6.04-6.25 (m, I'-H; IH), 
6.69-7.04 (m, fluorescein-H ; 4H), 7.09-7.32 (m, fluorescein-H ; 2H), 
7.56-8.46 ppm (m, 6-H, fluorescein-H; 5H); ^^PNMR (121 MHz, D2O, 
300 K): (5 = -10.63 (d, y-P), -11.56 (dd, a-P), -23.09 ppm (t, P-P, /„,^ = 
19.6, /y,^ = 19.6Hz); ESE-MS (m/z): calcd for [C47H49N6O25P3]: 1190.85; 
found: 594.6 [M-2Hf-, 1110.8 [M-PO3-H]-, 554.8 [M-FOs-lHf-, 
1190.4 [M-H]-. 

Solution-based primer-extension experiments: The oligodeoxynucleotides 
(shown in Figure 2 A) were used to form the DNA substrate for the ex- 
tension and termination experiments using 3'-0-(2-cyanoethyl)-modified 
nucleotides and an extension experiment using fluorescence-labelled re- 
versible terminators 2-5. The primer strand was 5'-radiolabelled and an- 
nealed to the complementary (template) strand by heating for 5 min to 
95 °C and gradually cooling to room temperature over 2h. For the 
primer extension, 10 um DNA duplex, polymerases in at least 10-fold 
molar excess, and 50 |lim of the 3'-modified nucleotides was used. The 3'- 
modified nucleotides were additionally purified by enzymatic depletion 
of the natural nucleotide counterpart before use.^^^^ The reactions were 
performed for 60 min at 37 °C in a total volume of the reaction mixture 
of 20 |iL, containing 33 mM Tris-acetate (pH7.9, at 37 °C), 10 mM magne- 
sium acetate, 66 mM potassium acetate, and BSA (O.lmgmL"^). After 
completion of the reaction, an aliquot of the reaction mixture was supple- 
mented by dTTP (up to 50 |um), and the reaction was allowed to proceed 



for additional 5 min at the same temperature. The reactions were stopped 
by adding an equal volume of the STOP solution: 95% formamide and 
100 mM EDTA. The products were resolved by using a denaturing poly- 
acrylamide (PAA) gel (7m urea; 15% 29:1). The gel was dried on What- 
man paper and autoradiographed by using a Fuji phosphorimager screen. 
For the evaluation of the polymerase specificity, the four substrate sys- 
tems were used (Figure 3 B). Concentrations of 5 um of each DNA 
duplex and 20U|liL-^ of the Revert Aid M-MuLV reverse transcriptase 
were used. The reaction buffer was the same as described above with the 
addition of MnCl2 (ImM) and dithiothreitol (DTT; 5mM). The reactions 
were performed for 5 min at 37 °C, then an aliquot of the reaction mix- 
ture was supplemented with all four natural dNTP derivatives (up to 
100 |iM each). The reaction was allowed to proceed for additional 5 min 
at the same temperature. Samples were analysed as described above. 
Spotting of DNA arrays and APEX reactions on CodeLink arrays with 
fluorescently labelled 3'-0-(2-cyanoethyl)-dNTP derivatives. All oUgonu- 

cleotide primers were diluted to a con- 
centration of 10 |iM in carbonate 
buffer (100 mM, pH9.0) and spotted 
onto CodeLink-activated slides from 
SurModics (Eden Prairie, MN, USA) 
with a Versarray Chipwriter Pro Ar- 
rayer (BioRad Laboratories, Hercules, 
CA, USA) as 8x8-probe squares ac- 
cording to the scheme shown in 
Figure 4. The oligonucleotide primers 
were immobilised onto the slide sur- 
face and the slides blocked before the 
APEX reactions, as suggested by the 
vendor. APEX reaction cycles on Co- 
deLink arrays consisted of the following steps: 1) Blocking of DNA 
arrays with blocking solution (50 |iL, Tris-acetate (33 mM, pH 7.9 at 
37 °C); magnesium acetate (10 mM), potassium acetate (66 mM), BSA 
(ImgmL^), DTT (lOmM), Tween-20 (1%), NP-40 (1%)) for lOmin at 
37 °C. 2) The first primer-extension reaction with 2-5 was carried out by 
treating the slide with the set of the fluorescently labelled reversible ter- 
minators 2-5 and RevertaAid M-MuLV reverse transcriptase at 37 °C for 
30 min. An aliquot of the reaction mixture (35 |iL) was used for one slide 
containing Tris-acetate (33 mM, pH 7.9 at 37 °C), magnesium acetate 
(lOmM), potassium acetate (66 mM), BSA (O.lmgmL^), MnCl2 (ImM), 
DTT (10 mM), Tween-20 (0.5%), Nonidet P-40 (0.5%), 2-hydroxyme- 
thyl[18]crown-6 (1%), 2-5 (100 |um each), and RevertAid M-MuLV re- 
verse transcriptase (10 UjuL"^). 3) Blocking of the starting-material oUgo- 
nucleotide primers on the array was accomphshed with TdT (MBI Fer- 
mentas, Vilnius, Lithuania) and ddATP by incubating the arrays for 
20 min at 37 °C. The TdT reaction mixture consisted of the reaction mix- 
ture with an aliquot (40 |iL) of potassium cacodylate (200 mM), Tris 
(25 mM), Triton X-100 (0.01%, v/v), C0CI2 (ImM, pH 7.2 at 25 °C), 
ddATP (ImM), and TdT enzyme (80 U). 4) The reactions were stopped 
by washing with deionized water at 95 °C. The slides were dried, covered 
with a droplet of Slowfade light antifade reagent (Molecular Probes, OR, 
USA), and imaged with Genorama Quattrolmager detector (Asper Bio- 
tech Ltd.). The fluorescence signal intensities were extracted and ana- 
lysed with BaseCaller module of the Genorama genotyping software 
package (Asper Biotech Ltd.). 5) Cleavage of the fluorescence dye and 
the 3'-0-(2-cyanoethyl) group from the incorporated fluorescence la- 
belled 3'-0-(2-cyanoethyl)-blocked nucleotides was accomplished with 
TBAF (1m) in THE Before the unblocking reaction, the DNA arrays 
were washed three times with THE The unblocking reaction was carried 
out by immersing the DNA arrays for 15 min at 45 °C into a 1:1 solution 
of TBAF/THF (Fluka, Buchs, Switzerland) and dry DMF 6) The DNA 
arrays were washed with deionized water, dried, and read with the Gen- 
orama Quattrolmager detector to be sure that the unblocking reaction 
was successful. 7) Before the second primer-extension reaction, the slides 
were blocked again with blocking solution (see above, step 1) for 10 min 
at 37 °C. 8) The second primer-extension reaction using the fluorescently 
labelled 3'-0-(2-cyanoethyl)-dNTP derivatives 2-5 and the reverse tran- 
scriptase was carried out under the reaction conditions described for the 
first extension step (step 2). 9) The reactions were stopped by washing 
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with deionized water at 95 °C. The shdes were dried, read, and analysed 
as described above for first primer-extension reaction. 
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